Referential integrity quality metrics

نویسندگان

  • Carlos Ordonez
  • Javier García-García
چکیده

Referential integrity is an essential global constraint in a relational database, that maintains it in a complete and consistent state. In this work, we assume the database may violate referential integrity and relations may be denormalized. We propose a set of quality metrics, defined at four granularity levels: database, relation, attribute and value, that measure referential completeness and consistency. Quality metrics are efficiently computed with standard SQL queries, that incorporate two query optimizations: left outer joins on foreign keys and early foreign key grouping. Experiments evaluate our proposed metrics and SQL query optimizations on real and synthetic databases, showing they can help detecting and explaining referential errors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Referential Integrity Browser for Distributed Databases

We demonstrate a program that can inspect a distributed relational database on the Internet to discover and quantify referential integrity issues for integration purposes. The program computes data quality metrics for referential integrity at four granularity levels: database, table, column and value, going from a global to a detailed view, exhibiting specific evidence about referential errors....

متن کامل

Investigation of Application Specific Metrics to Data Quality Assessment

Databases have risen to be one of the most important corporate assets, but usually their data quality is poor or even not manageable at all. Several metrics of data quality have been designed and implemented to monitor a database of an information system. The primary goal of data quality metrics design was to provide the managers of information centres the tools for monitoring of their database...

متن کامل

Referential Integrity Is Important For Databases

Referential integrity is a database constraint that ensures that references between data are indeed valid and intact. Referential integrity is a fundamental principle of database theory and arises from the notion that a database should not only store data, but should actively seek to ensure its quality. Here are some additional definitions that we found on the Web. • “Referential integrity in a...

متن کامل

Index Design for Enforcing Partial Referential Integrity Efficiently

Referential integrity is fundamental for data processing and data quality. The SQL standard proposes di↵erent semantics under which referential integrity can be enforced in practice. Under simple semantics, only total foreign key values must be matched by some referenced key values. Under partial semantics, total and partial foreign key values must be matched by some referenced key values. Supp...

متن کامل

A Study of Quality Assessment Techniques For Fused Images

290 Abstract: Critical image processing tasks can be efficiently executed by fusion of images taken from range of distributed sensors. Advancements in digital image processing and communication technology with invent of new sensors experiencing the excessive need of effective image quality assessment of image fusion techniques. Various metrics have been discussed for quality measurement of fuse...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Decision Support Systems

دوره 44  شماره 

صفحات  -

تاریخ انتشار 2008